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Abstract: 

The present invention is directed to MUC1 peptide fragments and to methods of producing 
those peptide fragments. The invention is further directed to an ex vivcwnethod of producing 
a population of autologous' antigen presenting cells (APC's) and of producing genetically 
engineered APC's, which are capable of inducing effective immune responses against MUC1. 
The invention also relates to APC's, which are obtainable by these methods as well as the use 
of the above mentioned fragments and APC's in a therapeutic composition for the treatment 
of breast cancer or other MUC1 -positive carcinomas including colorectal, pancreatic and 
gas trie carcinomas . _ 1 
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Immunogenic MUC1 glycopeptides 



The present invention is directed to MUC1 peptide fragments and to methods of producing 
those peptide fragments. The invention is further directed to an ex vivo-method of producing 
a population of autologous antigen presenting cells (APC's) and of producing genetically 
engineered APC's, which are capable of inducing effective immune responses against MUC1. 
The invention also relates to APC's, which are obtainable by these methods as well as the use 
of the above mentioned fragments and APC's in a therapeutic composition for the treatment 
of breast cancer br other MUC1 -positive carcinomas including colorectal, pancreatic and 
gastric carcinomas. 

MUC1 is overexpressed in breast cancer and by many other carcinomas and the tumor- 
associated glycoform of the mucin is known to expose multiple peptide epitopes within its 
repeat domain. These immunogenic peptide epitopes make MUC1 a promising tumor antigen 
with diagnostic as well as therapeutic potential in the treatment of cancer. 

The development of effective vaccine and immunotherapies for human cancers and infectious 
agents often is dependent on the generation of protective immune responses to specific' 
domains of membrane proteins. The tandem repeat (TR) domain of the breast, pancreatic, and - 
ovarian tumor antigen, human mucin MUC1 (Barnd et.al., PNAS USA^. 86: 7159-7163, 1989; 
Jerome et at, Cancer Res., 51: 2908-2916, 1991), the principal neutralizing domain' of fflV-1 
(Javaherian et al., PNAS USA 86: 6768-6772, 1989; Javaherian et al., Science, 250: 1590- 
1593, 1990) and the proline rich neutralization domain of the feline leukemia virus external 
surface unit protein (gp-70) (Nunberg et al., PNAS, 81: 3675-3679, 1984; Elder et al., J. 
Virol., 61: 8-15, 1987; Strouss et at, J. Virol., 61: 3410-3415, 1987; Nick et al., J. Gen. 
Virol., 71: 77-83, 1990) are examples hereof. 

Regarding MUC1, humoral and cellular responses have been demonstrated in cancer patients 
(1,2), but also in pregnant woman (3) and healthy individuals (4). Although these natural 
responses are usually insufficient to fight' the progress of cancer, MUC1 -derived peptides or . 



glycopeptides axe used currently in' clinical trials to trigger therapeutically and 
prophylactically immune reactions in humans (5,6). 

There is growing evidence that triggering of efficient humoral and CTL responses, to MUC1 
needs the activation of specific T helper, cell clones, which is induced by .MHC class II- 
presented antigen fragments. The generation of MHC. class H-restricted peptide epitopes by 
antigen presenting cells (APCs) like dendritic cells (DCs) follows a multistep process starting 
with endocytosis, followed by the processing in late endosomal compartments and resulting in 
the binding of proteolytic peptide fragments to MHC class II proteins and their transport to 
the cell surface. While many aspects of this complex process have been elucidated there is 
currently little evidence on the processing and MHC class II presentation of glycosylated 
antigens, in particular of the highly O-glycosylated mucin antigens. To enable the design of 
efficient tumor vaccines on the basis of MUC1 knowledge on how DCs or other APCs deal 
with O-glycosylated peptides is of importance. One particular question in this context refers 
to the fate of complex OJinked glycans during processing, since efficient peptide 
fragmentation may afford complete or partial removal of sugars prior to proteolysis. O-linked 
glycans could also direct the processing with respect to the accessibility of cleavage sites and 
hence restrict the pattern of peptide fragments on the one hand, while 'they enrich the pattern 
of epitopes on the other. The inventors have recently shown by immunological methods that 
MUC1 core-glycans are not removed during processing and that glycopeptides presented by 
MHC class n are able to activate T cell hybridomas (unpublished). •■ 

Although several of the cathepsins have been identified as components of the processing 
machinery (8,9), it is currently not known which eri2yme(s) are involved in the processing of 
MUC1 and at which sites within the repeat domain they actually cleave the protein. 
Expectedly, there are multiple cleavage sites and only subtractions of the generated peptide 
fragments should fulfill the requirements for binding to MHC class U molecules. Thus; it 
could be a valuable approach. for the design of new, anti-cancer vaccines to identify new 
immungenic MUC1 fragments capable of binding to MHC class H molecules. 



Therefore, it is an object of the present invention to provide novel immungenic MUC1 peptide 
fragments, which can be used for immunization in humans. 
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It is a further object of the present invention to provide a method of producing an 
immunogenic MUC1 peptide, which allows the originally contained glycosylate pattern to 
be conserved during the production process. 

These objects are solved by the subject-matters of the independent claims. Preferred 
embodiments are set forth in the dependent claims. 

Surprizingly, the inventor has found out that cathepsin L or a closely related enzyme shows a 
very restricted fragmentation pattern during mouse DC processing with only one preferred 
cleavage site per MUC1 repeat. The cleavage specificity and specific inhibition of the 
protease' were in agreement with the assumption of the inventor that cathepsin L or a closely 
related enzyme (cathepsins B or S) were involved in this highly specific cleavage. 

The experimental set-up used biotinylated and non-tagged, bead-coated synthetic 
glycopeptides comprising one or more repeat units of MUC1 with single or multiple O-linked 
core-type gjycans. Exogenously administered MUC1 'peptide fragments were rapidly taken up 
by mouse dendritic cells (DCs) and a large proportion' was processed in late endosomal 
compartments during . 4h. MUCl repeat peptide derived proteolytic fragments that were ' 
identified and sequenced show that the glycans are not removed during antigen processing 
and that the presence of carbohydrates affects the cleavage sites yielding a different repertoire 
of cleaved peptides. 

Surprizingly, the proteolytic products suggest a highly specific processing of the repeat 
peptide with one preferential cleavage site at the Thr-s'er peptide bond While human 
cathepsin D was unable to cleave the MUCl repeat peptide in vitro, human cathepsin L 
digestion resulted in specific hydrolysis of the Thr-Ser peptide bond. Since MUCl sequences 
contain a VTSA motif in eveiy repeat unit, the generated fragments start with the amino acid 
sequence SAP at their N-terminus. 

Information on the structure of processed MUCl glycopeptides is of utmost importance for 
the design of tumor vaccines. Intact O-glycosylation on processed MUCl repeat peptide 
contributes to a greater variety of the -MHC class Il-restricted helper T cell responses, thereby 
enhancing an overall anti-tumor response. 



According to the invention, a MU.C1 peptide fragment of the tandem repeat domain of human 
MUCT is provided, starting with the amino acid sequence SAP and comprizing at least 9 

• amino acids. The amino acid .sequence of human MUC1 is already known and can be found, 
for example, in the SWISS PROT database. The MUC1 protein contains varying numbers of 
amino acids due to a length polymorphism resulting from individually variable repeat 

• numbers, and, in the moment, at least 9 isoforms are known (1/A, 2/B, 3/C, 4/D, 5/SEC, 6/X, 
7/Y, 8/Z and 9/S, which are produced by alternative splicing). ' • ; 

In this invention, specific fragments of MUC1 are contemplated, which are derived from the a: 
synthetic or natural MUC1 sequence, which has been cleaved enzymatically at the VTSA 
motif contained in* all MUC1 sequences (or was chemically synthesized in case of synthetic 
fragments). The fragments of the present 1 invention thus -'can be obtained by cleavage of 
MUC1 sequences with cathepsin-L. Irrespective of the starting amino acid position in the 
repeat sequence (TAP, AHG, GST) and of the length of the peptides (20mer, 21mer, 25mer, 
lOOmer), cathepsin L cleaves specifically between Thr-Ser in the VTSA motif of the repeat 
peptide, thereby, resulting in the inventive peptide fragments. It is an essential feature of the 
present invention that all fragments start with the amino acid sequence SAP at their N- 
terminus. 

I ; . . 

1 * * . 

1 * 

As mentioned above, the MUC1 peptide fragment of the present invention is not limited in its 
length, and may, for example, comprise up to 100 amino acids or even more. However, amino 
acids are preferred, wherein the MUC1 fragment is in the range of 10 to 25 amino acids. 

According to a further embodiment, the invention provides specific MUC1 peptide fragments 
which comprise the amino acids of SEQ ID NO 1-4 or variants thereof, wherein said variants 
comprise one or more insertions, substitutions and/or deletions as compared to the sequence 
of SEQ ID No. 1-4, and wherein the biological activity is substantially equal to the activity of 
the peptide comprising the unmodified amino acid sequence of SEQ ID No. 1-4, and provided 
that said variants start with the amino acid sequence SAP and comprise at least 9 amino acids. 

In this context, the present invention provides the following peptides:' 

/ 

i ... 
SAPDTRPAPGSTAPPAHGVT (SEQ ID NO 1) 
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SAPESRPAP GSTAPAAHGVT (SEQ ID NO 2) 

1 

SAPESRPAP GSTAPPAHGVT (SEQ ID NO 3) . 

SAPDTRPAPGSTAPAAHGVT (SEQ ID NO 4) 
■4 _ — ' 

The arrow indicates that the present invention also encompasses variants of the above 
mentioned amino acid sequences, which are reduced by one or more, amino acids starting 
from the C-terminus, under the proviso that the variants at least comprise the 9 N-terminal 
amino acids of the above indicated sequences (bold printed). 

■ The term biological activity as used herein is related to the immunogenic function of the 
inventive amino acid sequences. As mentioned above, MUC1 is naturally overexpressed in. 
various cancers, like breast cancer and other adenocarcinomas, , and therefore, it is an 
important target for immune based anti-cancer therapy. Thus, the MUC1 fragments, as. 
disclosed hereinbefore are contemplated as long as they are capable of inducing an. 
immungenic reaction in mammals, preferably humans, in order to initiate/promote an attack 
of the patient's immune system against the respective cancer. , 

The amino acid sequences of the present invention also encompass all sequences differing 
from the herein disclosed sequences by amino acid insertions, deletions, and substitutions. 

Preferably, amino acid "substitutions" are the result of replacing one amino acid with another 
amino acid having similar structural and/or chemical properties, i.e., conservative amino acid 
replacements. Amino- acid substitutions may be made on the basis of similarity in polarity, 
charge, solubility, hydrophobicity, hydrophilicity, and/or the amphipathic nature of the ' 
residues involved/For example, nonpolar (hydrophobic) amino acids include alanine, leucine, 
isoleucine, valine, proline, phenylalanine, tryptophan, and methionine; polar neutral amino 
acids include glycine, serine, threonine, cysteine, tyrosine, asparagine, and glutamine; 
positively charged, (basic) amino acids include arginine, lysine, and histidine; and negatively 
charged (acidic) amino acids include aspartic acid and glutamic acid. 

"Insertions" or "deletions" are typically in the range of about 1 to 3 amino acids. The variation 
allowed may be experimentally determined by systematically making insertions, deletions, or 




substitutions of amino acids in a polypeptide molecule using recombinant DNA techniques 
and assaying the resulting recombinant variants for activity. This does not require more than 
routine experiments for the skilled artisan. In case of MUC1 repeats three positions are known 
to exhibit a sequence polymorphism in the population (Engelrriann et al., 2001, J. Biol. Chem. 
276, 27764 - 27769; PCT patent application PCT/DE00/00440, the disclosure thereof being 
incorporated in its entirety in this application). 

The present invention is further directed to a nucleic acid, which encodes one of the above 
mentioned MUC1 peptide fragments. 

The term, "nucleic acid sequence" refers to a heteropolymer of nucleotides or the sequence of 
these nucleotides. The terms "nucleic acid" and "polynucleotide' 1 are used interchangeably 
herein to refer to a heteropolymer of nucleotides . 

The polynucleotides of the present invention also include, but are not limited to, a 
'polynucleotide that hybridizes to the complement of the disclosed nucleotide sequences under 
moderately stringent or stringent hybridization conditions; a polynucleotide which is an allelic 
variant of any polynucleotide recited above; a polynucleotide which encodes a species 
homologue of any of the herein disclosed proteins; or a polynucleotide that, encodes a 
polypeptide comprising an additional specific domain or truncation of the disclosed proteins. 

Stringency of hybridization, as used herein, refers to conditions under which polynucleotide 
duplexes are stable. As known to those of skill in the art, the stability of duplex is a function 
of sodium ion concentration and temperature (see, for example, Sambrook et al., Molecular 
Cloning: A Laboratory Manual 2 nd Ed. (Cold Spring Harbor Laboratory, (1989)). Stringency 
levels used to hybridize can be readily varied by those of skill in the art. 

Low stringency hybridization refers to conditions equivalent to hybridization in 10% 
formamide, 5 x Denhart's solution, 6 x SSPE, 0.2% SDS at 42°C, followed by washing in 1 x 
SSPE, 0.2% SDS, at 50°C Denhart's solution and SSPE are well known to those of skill in the 
art as are other suitable hybridization buffers. 

Moderately stringent hybridization refers to conditions that permit DNA to bind a 
■complementary nucleic acid that has about 60% identity, preferably about 75% identity, more 



preferably about 85% identity to the DNA; with greater than about 90% identity to said DNA 
being especially preferred. Preferably, moderately stringent conditions are conditions 
equivalent to hybridization in 50% formamide, 5 x Denhart's solution, 5 x SSPE, 0.2% SDS at 
42°C, followed by washing in 0.2 x SSPE, 0.2% SDS*, at 65°C. 

High stringency hybridization refers to conditions that permit hybridization of only those 
nucleic acid sequences that form stable duplex in 0.01 8M NaCl at 65 6 C. (i.e., if a duplex is 
not stable in 0.0 18M NaCl at 65 °C, it will not be stable under high stringency conditions, as 
contemplated herein). 

Further, nucleic acid hybridization techniques can be used to identify and obtain a nucleic 
acid within the scope of the invention. Briefly, any nucleic-acid having some homology to a 
sequence set forth in this invention, or fragment thereof, can' be used as a probe to identify a 
similar nucleic acid by hybridization under conditions of moderate to high stringency. Such 
similar nucleic acid then can be isolated, sequenced, and analyzed to determine whether they 
are within the scope of the invention as described herein. 

According to a preferred' embodiment, the MUC1 peptide fragments of the present invention 
are O-glycosylated at one or more of the threonines or serines . contained in the sequence. 
Preferably, the MUC1 peptide fragments of one of SEQ ID No. 1-4 are glycosylated, at Thr 5 
and/or 12. However, also all other serines or threonins in the respective sequences may be 
glycosylated. A preferred glycan used, herein is GalNAc or further complex glycans, which 
are derived therefrom. 

According to a further aspect, the present invention provides a method of producing the' 
inventive MUC1 peptide fragments, comprising the following steps: ' 

a) providing a peptide comprising the tandem repeat domain of MUC1 or a part thereof, 
which -part at least contains one repeating unit of said tandem repeat domain of MUC1, 

b) contacting the peptide of a) with an effective amount of cathepsin-L, thereby cleaving 
the peptide, and 

c) isolating the fragments produced in b). 

Preferably, the peptide provided in a) is a MUC1 protein showing a natural glycosylation 
pattern. As mentioned above, it was surprisingly found out- by the present inventor that a 
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cathepsin-L cleavage as peifonried in step b), leaves the glycosylation pattern of the MUC1 
protein* provided in a), intact. Intact O-glycosylation on processed MUC1 repeat peptides in 
turn contributes to a greater variety of the MHC class II-restricted helper T cell responses, 
thereby enhancing an overall anti-tumor response in patients. Thus, the inventive method 
leads to a MUC1 peptide fragment, which can be easily processed by the patient's APC's, for 
example dendritic cells, by the MHC class II pathway, and will be presented with an intact 
glycosylation pattern leading to an enhanced immune response of helper T-cells. In this 
context it should be noted that there is no restriction /regarding the' glycosylation pattern, 
however, threonin glycosylated at the cleavage site leads to a Thr-Ser bonding, which is stable 
to cathepsin L proteolysis. ■ ■. 

Of course, the above mentioned method is not the only one which leads to the inventive 
peptide fragments, whether glycosylated or not. It is also possible to chemically synthesize 
those fragments thereby providing, for example, a desired glycosylation pattern. To 
synthesize glycopeptides, glycosylamino acid building blocks are required which already 
contain the oligosaccharide chain and threonine or serine. The syntheses of these building 
blocks have been, described (Mathieux, N., Paulsen, H., Meldal, M., Bock, K. J.Chem.Soc., 
Perkin Trans. 7: 2359-2368, 1997). The multiple column solid phase synthesis can be carried 
out in a. semi-manual 20-column multiple synthesizer, and Wang resin can be selected as 
support material. The Wang resin (2,5 g) can for example be placed in a glass reactor, swelled 
in dichloromethane (15 cm 3 , 10 min.) and washed. A mixture of Fmoc-Ala-OH (3,40 mmol), 
l-(mesitylenesulfonyl)-3-nitro-l,2,4-triazole (3,40 mmol) and methylimidazole (3,40 mmol) 
in dichloromethane (15 cm 3 ) was added. After 2 h, the resin can be washed and the unchanged 
amino groups can be acetyl ated with Ac 2 0/DMF (1:1; 15 cm 3 ). The derivatized resin is then 
packed for the glycopeptide synthesis in the 20 columns of the synthesizer. The reaction and 
washing solvent can be DMF, the Fmoc deprotections were performed by treatment with 
piperidine (20 %) in DMF (20 min ; ). The amino acids are coupled as Fmoc amino acid Pfp 
ester with Dhbt-OHi (3 mol equiv.). The Gal(l->3)GalNAc-containing building block are 
coupled with TBTU and N-ethyldiisopropylamine (1,5 mol equiv.). After 20 h reaction time 
the synthesis cycle is repeated to complete the assembly of each glycopeptide. After removal 
of the last Fmoc groups, the resins are washed, dried, treated with 95 % aq TFA (2 cm 3 , 2 h), 
and filtered off. Then, the compounds is treated with catalytic amounts of 1 % CHsONa in 
methanol at pH 8,5 to remove the acetylic groups of the saccharide part, and purified by 
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preparative RP-flPLC The pure O-glycopeptides are 'obtained in yields of ' 16-57 % after 
lyophilizafion. • 

Preferably, glycopeptides are formed containing Clinked GalNAc or elongated complex 
glycans at one or several of the threonine or serine residues. ' .' 

Preferably, the peptide provided in step a) is represented by natural MUCl derived from 
human milk fat membranes (see Muller et al., 1997, J. Biol. Chem. 272, 24780 - 24793) from 
tumor ascites (Beatty et al., 2001, Clin. Cancer Res. 7, 781 - 787) or from human- breast 
carcinoma cell lines (Muller et al, 2002, J. Biol Chem. 277, 26103 - 261 12) or is represented 
by SEQIDNO 5, 6, 9 or 10. 

Furthermore, the inventive amino acids of the peptide provided in step a) of the above method 
of producing the inventive MUCl peptide fragments are O-glycosylated, however, provided 
that the peptide is not glycosylated at the cleaving site of cathepsin-L. Preferably, one or more 
of the threonines , or serines of the inventive peptide fragment- isolated in c) are O- 
glycosylated. . " • _ 

'* • ' •• . '• 

According to a further aspect, a MUCl peptide fragment is provided, which is obtainable by 
the above mentioned methods. 

The invention is further directed to an ex vivo-method of producing a population of 
autologous antigen presenting cells (APC's), which is capable of inducing effective immune 
responses against MUCl, comprising the steps of . 

a) contacting the autologous APC's from tumor patients with an effective amount of an 
inventive MUCl peptide fragment under conditions which allow endocytosis 
, processing and MHC class H presentation of the peptide fragments by said APC's and 
• b) 1S olating said MUCl peptide presenting APC's for the. purpose of immunotherapeutic 
application in patients. 

Preferably, the MUCl peptide fragment, in a) , are bound to coated ferric oxide beads ■ 
However, it is noted tbat all other known beads can be nsed for the pnrpose of the. above 
mentioned method Generally, all beads can be used, which are no. larger man approx. 1-2 
Mmm s,ze and allow a covalent coupling of antibodies and lectines. 



Furthermore, an ex vivo-method of producing genetically engineered APC's is provided, 
which are capable of inducing effective immune responses against MUCl, comprising the 
steps of ' 

a) providing a nucleic acid, which codes for one of the inventive MUCl peptide 
fragments; 

b) .transacting the APC's with said nucleic acid, and 

c) selecting APC's, which present said MUCl peptide fragments in a MHC H restricted 
manner. 

According to a preferred embodiment, the nucleic acid in step a) is provided in an expression 
vector. This expression vector preferably comprises one or more regulatory sequences. The 
term "expression vector" ' generally refers to a plasmid or phage or virus' or vector, for., 
expressing a polypeptide from a DNA (RNA) sequence. An expression vector can comprise a 
transcriptional unit comprising an assembly of (1) a genetic element or elements having a 
. regulatory role in gene expression, for example, promoters or enhancers, (2) a structural or 
coding sequence which is transcribed into mRNA and translated into protein, and (3) 
appropriate transcription initiation and termination sequences. Structural units intended for. 
use in yeast or eukaryotic expression systems preferably include a leader sequence enabling 
extracellular secretion of translated protein by. a host cell. Alternatively, where recombinant 
protein is expressed, without a leader or transport sequence, it may include an N-terminal 
methionine residue. This residue may or may not be subsequently cleaved from the expressed 
recombinant protein to provide a final .product. 

According to a further aspect of the invention, an APC is provided, which is obtainable by 
one of the aforementioned methods. Preferably, this APC is a dendritic celi or a B cell. 

' ■ ' » 

Furthermore, the present invention provides a therapeutic composition, comprising the 
inventive MUC 1 peptide fragment or the inventive APC's and a pharmaceutically. acceptable 
carrier. Such a composition may also contain (in addition to the ingredient and the carrier) 
diluents, fillers, salts, buffers, stabilizers, solubilizers and other materials. well known in the 
art. The.term "pharmaceutically acceptable" means a non-toxic material that does not interfere 
with the effectiveness of the biological activity of the active ingredient(s). The characteristics 
of the carrier will depend on the route of administration. The therapeutic composition may 



further contain other agents which either enhance , the activity or use in treatment. Such 
additional factors and/or agents 'may be included in the therapeutic composition to produce a 
synergistic effect or to minimize side-effects. : ; 

Techniques for formulation and administration of the compounds of the present application 
may be found in "Remington's Pharmaceutical Sciences", Mack Publishing Co,, Easton, PA, 
latest edition. \ * 

The compositions contain a therapeutically effective dose of the respective ingredient. A 
therapeutically effective dose refers to that amount of the compound sufficient, to result in 
amelioration of symptoms, e.g., treatment, healing, prevention or amelioration of such 
k conditions, specifically in an induction of an immune response in the patient. Suitable routes 
of administration may, for. example, include parenteral delivery, including intramuscular and 
subcutaneous injections; as well as intrathecal, direct intraventricular, intravenous, 
intraperitoneal injections. Intravenous administration to the patient is preferred. 

A typical composition for intravenous infusion can.be made up to contain 250 ml of sterile 
Ringer's solution,; and 10 mgof the ingredient. See Remington's Pharmaceutical Science (15 th 
Ed., Mack Publishing Company, Easton, Ps., 1980). Preferably, the therapeutic composition 
of the present invention is a vaccine. 

1 As mentioned above, this vaccine finds application for use in the treatment of breast cancer . 
or other MUC1 -positive carcinomas including colorectal, pancreatic and gastric carcinomas. 

Unless otherwise defined, all technical and scientific terms used herein have the same 
meaning as commonly understood by one. of ordinary skill in the ait to which this invention 
pertains. All publications, patent applications, patents, and other references mentioned herein 
are incorporated by reference in their entirety. In case of conflict, the present specification, 
including definitions, will control. In addition, the materials, methods, and examples are 
illustrative only and not intended to be limiting. 

The invention is now further illustrated by the accompanying drawings, which are showing 
the following: 



Fig. 1 

MUC1 glycopeptide processing by mouse dendritic ceils 

A mixture of bibtinylated glycopeptides HI to H3 (AHGVTSAPDTRPAPGSTAPPA; SEQ 
ID NO 5) and H4 to H6 (AHGVTSAPESRP APGSTAPAA; SEQ ID NO 6) corresponding to 
a partial sequence of MUC1 tandem repeats and glycosylated with GalNAc at Thr5 (HI, H4), 
ThrlO/SerlO (H2, H5) or Thrl7 (H3, H6) was used for pulsing of mouse dendritic cells D2.4. 
Processing products were affinity-isolated from cellular fractions or from culture 
supematants,- reduced with dithiothreitol to cleave the label, and analysed by reflecfron 
. MALDI mass spectrometry in the positive ion mode. 

A, centrifuged cellular fraction; 

B, magnetically separated cellular fraction; 

C, culture supernatant 

The major signals at xdx 2249.0 (HI to H3) and 2223.0 (H4. to H6) correspond to the 
precursor glycopeptides, the signals at m/z 1695.7 (PI) and 1669.7 (P2) to the SAP 16 
fragments (PI derived from HI to H3; P2 from H4 to H6). 

Fig. 2 

Peptide sequencing of processing pr oducts PI and P2 by LC-MS/MS analysis on a Qtof2 
electrospray mass spectrometer 

(Glyco)peptides were separated by nanoflow liquid chromatography on a reversed-phase 
microcapillary column and analysed online by electrospray mass spectrometry in the positive 
ion mode. B ion and y ion fragment series from the N-terminal and C-terminal sequences of 
the major peptide products (A, PI at m/z 1695; B, P2 at m/z 1669) were used.to confirm the 
sequence of SAP 16 peptides (refer to C). 

Fig. 3 • 

In vitro proteolysis of MUC1 glycopeptides by human cathepsin L 

MUC1 glycopeptides (10 ug) were treated for 3h with 2 to 5 munits of cathepsin L in 0.1 M 
sodium acetate, pH 5.5, containing I mM EDTA, and 1 mM DTT. Reflection MALDI mass 
spectra were recorded in the positive ion mode using a-cyano-4-hydroxy cinnamic acid as 
matrix.. ' 

A, Biotinyla'ted A3 glycopeptide (m/z 2784.7), A3 glycopeptide after reductive cleavage of 
the label (m/z 24 1 2. 6), SAP 1 6 fragment (m/z 1 858.3). 
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B, Products of non-tagged A3 glycopeptide: SAP 16 fragment (m/z 1858.0), APD15 (m/z 
1771.0), TSA17 (m/z 1959.0). 

i • 

Fig. 4 

Cathepsin L-like activity in low-density endosomes from mouse dendritic cells cleaves 
MUC1 repeats at Thr-Ser 

Low-density endosomes in mouse dendritic cells were separated firom lysosomes and plasma 
membranes by density gradient centrifugation in percoll/sucrose (30 ml). Fractions- (1 ml) , 
were tested for the presence of marker proteins/enzymes ^-hexosaminidase, H2 antigen) and 
for proteolytic activity. TAP25 peptide was used as substrate (20h, 37 °C, pH 5.5, 1 mM 
EDTA, J mM DTT). 

Materials and Methods 

Isolation and cultivation of dendritic cells * 
Immortalized dendritic cells (clone D2.4) from G57BL/6 mice was grown in DMEM 
supplemented with 10% FCS, L-glutamine, 0.1% 2-mercaptoethanol, and antibiotics at 37°C 
and 5% C0 2 (9). 

Generation of synthetic (biotinylated) MUC1 glycopeptides and their conjugation to magnetic 
ferric oxide beads 

Glycopeptides HI to H6 con*esponding to MUC1 tandem repeat peptides based on the 
AHG2 1 sequences AHGVTSAPDTRPAPGSTAPPA (H 1 to H3) and 
AHGVTSAPESRPAPGSTAPAA (H4 to H6) and carrying GalNAc at Thr5, Thrl-0, or Thrl7 
were chemically synthesized according to previously published protocols (10) and isolated 
successively on preparative and analytical reversed-phase columns on an HPLC workstation 
(System Gold, Beckman, Munchen, Germany). The same holds true for. the glycopeptide 
series HI 1 to H13 (GalNAc substituted) and A3 (substituted with Gal|3 1-3 GalNAc at Thrl7),, 
which are based on the same peptide sequence as HI to H3. The lOOmer ' peptide, 
corresponding to five repeats of the MUC1 domain and starting with the HGV motif was ' 
synthesized by a local facility (University of Pittsburgh) and in vitro glycosylated with 
GalNAc using purified polypeptide GalNAc-transferases-Tl and-T2 (kindly provided by Dr. 



Henrik Clausen, School of Dentistry, University of Copenhagen, Denmark) under conditions 
described previously (1 1 , 1 2). 

Glycopeptides HI to H6 (100 ug each) were biotinylated with [2-(Biotinamido)ethylamido]- 
3,3'-dithiopropiohic acid N-hydroxysuccinimide ester (100 mM in DMSO, 100 ul) at 50°C 
over a period of 48h. After evaporation of the solvent by vacuum centrifugation the 
biotinylated products were separated from non-tagged glycopeptides and excessive reagent by 
reversed-phase chromatography on a PLRP-S column. 

Anti-MUCl dynabeads were prepared by covalent coupling of 50 ug B27.29 monoclonal 
antibody to tosylated M-280 beads (Dynal, Hamburg, Germany) .in 0.1 M borate buffer, pH 
9.5 (200 ^1) for 48h at ambient temperature. Lectin-coated dynabeads were prepared similarly 
by conjugation of 50 ug Helix pomatia agglutinin to M-280 beads. Antibody- and lectin- 
coated^eads (10 8 ) were complexed with (biotinylated) glycopeptides. (50 ug) by incubation in 
250 ul AIMV medium under roUing for 2h at ambient temperature. 

Antigen pulse of dendritic cells and isolation of peptide fragments 

Mouse dendritic cells D2.4 (10 7 cells/ml) were transferred into a 15 ml Falcon tube, 
suspended in AIMV medium and preincubated for lh at 37?C (5% CO ? ). Antigens were 
added as a mixture of biotinylated glycopeptides HI to H6 (50 pig) after conjugation to anti- 
MUC1- antibody, and lectin (HPA)-coated dynabeads (each at 5 x 10 7 beads / ml final 
concentration). The 1 ml suspension was' incubated with occasional shaking at 37°C (5% 
C0 2 ) for a total time period of 4h. After pulsing the cells were separated from the medium in 
two ways: an orthogonal magnetic separation of bead-loaded cells and free dynabeads and a 
centrifugation of the remaining cells (180 g, 5 min), which had not been removed in the first 
step due to a lower antigen/bead load. Both cell fractions were washed several times in 
phosphate (4 mM), NaCl (153 mM), pH 7.2, while the cell-free supernatant was re- 
centrifuged at 3000 g (5 min, 4°C). The cell fractions were treated on ice for 15 min with 100 
ul 1% NP40, 10 mM Tris-HCl, 150 mM NaCl, pH 8.0 containing a cocktail of protease 
inhibitors (Sigma P8340, Miinchen, Germany) followed by ultrasonication for 2 min. 



In experiments where biotinylated antigens had been used the cell extracts were diluted 
twofold with PBS and incubated with 2 x 10 8 streptavidine-coated dynabeads M-270 for 30 



min at 37°C and another. 30 min period with rolling at ambient temperature. After magnetic 
separation and washing of the beads for three times the bidtinylated glycopeptides were 
cleaved-off by reduction with 10 mM dithiothTeitol at 56°C (30 min), dried by vacuum 
centrifugation and taken up in 0.1% aqueous trifluoroacetic acid. Since both cell fractions 
contained considerable proportions of antibody- and lectin-coated • beads, also non-tagged 
peptide and glycopeptide fragments were isolated during this separation step. Considerable 
amounts of non-tagged MUCl glycopeptides were also demonstrated to bind to streptavidine- 
coated dynabeads via undefined mechanisms and to elute during heating under reducing 
conditions. Peptides and glycopeptides contained in the cell-free supernatant were affinity- 
isolated by using mixtures of streptavidine-, anti-MUCl antibody-, and lectin-coated beads or 
by applying reversed-phase chromatography on 50 ul columns of Poros CI 8. After loading of 
the reversed-phase column with 500 ul of supernatant the sample was desalted by washing 
with 0.1% aqueous trifluoroacetic acid and .eluted with 80% acetonitril in 0.1% aqueous 
• -trifluoroacetic acid 

i 

Mass spectrometric analyses 

MALDImass spectrometry: The peptide and glycopeptide samples (20 ul). contained in 0.1% 
aqueous trifluoroacetic acid or in mixtures with acetonitril were, applied to the stainless steel 
target by mixing a 1 ul aliquot with the same volume of matrix (saturated solution of a- 
cyano-4-hydroxycinnamic acid in acetonitril / 0.1% TFA, 2:1). Mass spectrometric analysis 
was performed on a Bruker-Reflex IV instniment (Bruker-Daltonic, Bremen, Germany) by 
positive ion detection in the reflectron mode. Ionization of co-cristallized analytes was 
induced with a pulsed nitrogen laser beam (337 nm) and the ions were accelerated in a field of 
20 kV and reflected at 23 kV ( 1 2, 13). 

Nanqflow liquid chromatography with on-line ESI mass spectrometry: LC/MS data were 
acquired on a Q-Tof II quadrupole-time of flight mass spectrometer (Micromass, Manchester, 
UK) equipped with a Z spray source. Samples were introduced using the Ultimate nano-LC 
system (LC Packings, Amsterdam, Netherlands) equipped with the Famos autosampler and 
the Switchos column, switching module. The column setup comprised a 0,3 mm x 1 mm trap 
column and a 0,075 x 150 mm analytical column, both packed with 3 um PepMap CI 8 (LC 
Packings, Amsterdam, Netherlands). Samples were diluted 1:10 in 0,1 % TFA. 10 ul were • 
injected onto the trap column and. desalted for 3 min using 0,1 % TFA and a flow rate of 30 



ul/min. The 10 port valve switched the trap column into the analytical flowpath and the 
peptides were eluted onto the" analytical column using, a gradient of 5 % ACN in 0, 1 % formic 
acid to 40 % ACN in formic acid over 20 min and a column flow rate of approximately 
200 nl/min, resulting from a 1: 1000 split of the 200 ul/niin flow delivered by the pump. The 
ESI interface comprised, a metal coated PicoTip spray emitter (New Objective, MA) mounted 
onto the PicoTip holder assembly (New Objective). Stable nanospray was established by the 
application of 2,5 - 3,0 kV to the distal end of the PicoTip and a nitrogen counter flow rate of 
approximately 40 L/min. Data dependent acquisition of MS and MS/MS spectra was 
controlled by the masslynx software. Survey scans of 1 sec covered the range from m/z 400 to 
m/z 1200. Doubly and triply charged ions rising above a given threshold were selected for 
MS/MS experiments. In MS/MS mode the mass range from m/z 40 to m/z 1400 was scanned 
in 1 sec and 10 scans were added up for each experiment. Micromass formated peaklists were 
generated from the raw data using the Proteinlynx software module. 

Confqcql Laser Scanning Microscopy and fluorescence-activated cell sorting 

Antigen uptake was quantitated by flow cytometric analysis using "a Becton Dickinson 
FACScalibur according tp a previously published protokol (14). Prior to microscopic 
inspection DCs were fixed with 2% formaldehyde, and permeabijized with 0.1% saponin. 
Following staining with anti-MUCl antibody (B27.29, Biomira,. Edmonton, Canada), 
biotinylated secondary anti-mouse Ig (Dako, Hamburg, Germany) and FITC-labelled- 
streptavidine (Sigma), the cells were fixed a second time with 1% paraformaldehyde, the 
chambers of the slides were removed, and the slides were mounted for the analysis by 
confocal- laser scanning microscopy on a Leica DM IRE2 (14). 



In vitro proteolysis of MUC1 (glycp)peptides with human cathepsins and. low-density 
endosomal fractions from mouse dendritic cells 

Human cathepsin L and D were purchased from Sigma (Miinchen, Germany) and solubilized 
in 0.1- M sodium acetate buffer, pH 5.5, containing 1 mM EDTA (cathepsin D) and 1 mM 
dithiotreitol (cathepsin L). 2 - 5 munits of enzyme(s) were' added to 10 ug of (glyco)peptide 
substrates (corresponding to approx 5 nmol of repeat units) in a total volume of 20 ul 
digestion buffer (see above). The reaction mixtures were incubated at 37°C and 2 ul were 



withdrawn after 3h or 24h and diluted 10 to 20fold in 0.1% aqueous TFA 'prior to MALDI 
mass spectrometry. 

Mouse dendritic cells (10 s ) were homogenized by fine-needle aspiration on ice using 1 ml of 
. 0.3 M sucrose, 0.01 M Hepes as buffer (without protease inhibitors). After dilution to 7 ml 
and centrifugation at 850 g for 10 min to remove intact cells, and nuclei, 6 ml of the 
' supernatant were centrifuged over 24 ml of 35% Percoll with 0.3 M sucrose, 0.01 M Hepes 
for 105 min at 20.000 rpm in a centrifuge (model J2-21 M/E, rotor: . JA-20, Beckman 
instruments, Miinchen, Germany) (15). The gradient was fractionated by gravity siphon (30 x 
1 ml) and each fraction was analysed after sonication for the- presence of MHC class II 
molecules by enzyme immunoassay with ahti-H2 antibody, p-hexosaminidase activity (15) 
and cathepsin L related proteolytic activity using TAP25 peptide as substrate (5 ug). The 
samples were incubated for 24h at 37°C, diluted 20fold in aqueous TFA and analysed by 
MALDI mass spectrometry. For specific inhibition of cathepsin L activity, the corresponding , 
fractions were mixed with 1 uM Z-Leu-Leu-Leu-fluoromethyl ketone (Sigma). 

Processing of MUC1 glycopeptides by mouse dendritic cells 

In a series of experiments a variety of application modes for the pulsing DCs with antigens 
were tested by using confocal laser microscopy and fluorescence activated cell sorting in the 
kineticanalysis of Cy3-labelled MUC1 peptide (lOOmer). According to quantitative FACScan 
data the proportion of fluorescently labelled cells increased with time up to 2h, but even more , 
with the mode of antigen application as free peptide, as peptide-antibody . complex, or as 
conjugates to paramagnetic ferric oxide beads. The latter were most effectively incorpprated 
by the cells irrespective of the bead size (50 nm or 1 um diameter) and irrespective of the 
mode of peptide conjugation to the beads via immobilized streptavidine, anti-MUCl antibody 
(C595, B27.29) or Helix pomatia agglutinin (in case of.GalNAc substituted glycopeptides). 
While more than 80% of mouse DCs were loaded with antigen-coated beads (1 um) after 2h 
incubation time, a' proportion of only 15 to 20% was reached with peptide-antibody 
complexes and below 5% with free peptide antigen. In accord with this, all attempts to 
identify peptide processing products after pulsing with . free . antigen or antigen-antibody 
complexes were unsuccessful. 
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Mouse DCs were adapted to AIMV medium and pulsed for 4h with a mixture of biotinylated 
glycopeptides (HI - H6) conjugated to antibody- and lectin-coated beads. Three fractions, the 
magnetically separated cell pellet, the centrifuged cell pellet and the supernatant were 
analysed for the presence of proteolytic fragments derived from the 21mer GalNAc-peptides 
(Fig. 1). Peptide isolation was performed by magnetic separation of streptavidine-coated 
beads after incubation with cell extracts or supernatant or by feversed^phase solid-phase 
extraction on Poros C18. The fractions were analysed by MALDI(tof) mass, spectrometry to 
obtain the mass pattern of the peptide products and by nanoflow LC-ESI mass spectrometry in 
the MS/MS mode to get sequence information. The results presented in Fig. 1 indicate 
extensive fragmentation of the glycopeptides, the supernatant containing mostly processed 
products, while the cellular fractions contained also considerable proportions of residual 
21mer glycopeptides. The AHG21 glycopeptides AHGVTSAPD(E)T(S)RPAPGS TAPP(A)A 
(substituted with one GalNAc residue) were identified at m/z 2249.0 and 2223.0, respectively, 
corresponding to the masses of N-thiopropionylated HI to H3 (m/z 2249.0) and H4 to H6 
(m/z 2223.0). The only products identified were registered at m/z 1695.7 (Plj and m/z 1669.7 
(P2), respectively, corresponding to the GalNAc containing peptide fragments SAP16. The 
sequence of the two peptide products (PI,- P2) were confirmed by MS/MS on a Qtof2 
instrument to comprize 16 aa long C-terminal portions of the AHG21 glycopeptides (Fig. 2), 1 

P 1 SAPDTRPAPGSTAPPA, (SEQ ID NO 7) and 

P2 SAPESRPAPGSTAPAA, (SEQ ID NO 8) 

both containing GalNAc at Thr / SerlO or Thrl7 (numbering according to the AHG21 
sequence). No SAP 16 peptides devoid of GalNAc were registered at m/z 1492 and 1466, 
respectively, indicating that proteolysis of AHG21 with. GalNAc at Thr5 adjacent to the 
cleavage site had not occurred. Sequencing of the remaining AHG21 glycopeptides by 
MS/MS analysis revealed that the GalNAc was preferentially located at Thr5, which is in 
accord with the assumption that undigested protease-resistant AHG21 glycopeptides were 
identical to HI and H4, respectively (Fig.. 2). The five aa N-terminal proteolytic fragment 
AHGVT (N-thiopropionylated) was riot detected in any of the spectra. Control experiments 
with DC primed AIMV media without antigen (4h) indicated that no proteolytic activity had 
been secreted into the. medium resulting in detectable endopeptidase cleavage of TAP25 • 
peptide after 24h incubation at 37 °C. However, after adjustment of the supernatant to • 
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conditions optimal, for cystein proteases (pH5.5, 1 mM dithiotreitol) minor exppeptidase 
cleavage of the peptide was registered in' the mass spectrum (data not shown). Hence the 
SAP 16 fragments detected in the supematants of antigen-pulsed cells can be regarded as 
cellular products and not as extracellular products of secreted proteases. 

i 

In vitro proteolysis of MUC1 glycopeptides with human cathepsin L 

To verify the processing data obtained with mouse DCs and to confirm the proposed identity 
of the preferentially involved protease(s) we- performed a series of in vitro digestions with 
cathepsin L and selected peptide substrates (Tab. 1, Fig. 3). Using standard conditions for. 
, cystein proteases and incubation times of 3h the enzyme was able to. cleave all non- 
glycosylated MUC1 repeat peptides quantitatively, except for the variant GST20-AES peptide 
(80% cleavage). . Irrespective of the starting amino acid position in the repeat sequence (TAP, 
AHG, GST) and of the length of the peptides (20mer> 21mer, 25mer, lOOmer), cathepsin L 
cleaved specifically .between Thr-Ser in the VTS A motif of . the repeat peptide. Besides this 
preferential cleavage site, which is in accord with the in vivo data, minor activities of the 
enzyme preparation were found to be directed to the adjacent positions Val-Thr (TSA17 at 
m/z 1959) andSer-Ala (APD15 at m/z 1771). To exclude the possibility that aminopeptidases 
could be responsible for the generation of these minor products, a protected substrate carrying 
a biotin label at the amino terminus was used as substrate (Fig. 3A). Also in agreement with 
DC-mediated processing Was the finding that O-glycosylated peptides, carrying GalNAc or 
Gal|31-3 GalNAc, were effectively digested (Tab. 1, Fig. 3B). Moreover, the positibn of. 
glycan attachment to one of the tree threonines (Thr5, ThrlO, Thrl7) in the AHG21 sequence 
was found to be critical as suggested by the in vivo data. While.GalNAc or Galp l-3GalNAc in 
positions more distant from the cleavage site (ThrlO, Thrl7) had no influence on the cleavage 
by cathepsin L, the glycopeptides HI and H4, both being glycosylated at the cleavage site 
(Thr5),.were stable to proteolysis.. Minor exopeptidase activity was detectable (in case of 
these glycopeptides) in the cathepsin L preparation from human liver. As a control, human 
cathepsin D was tested with a selected panel of MUC1 repeat peptides and glycopeptides and 
found to be unable to use any of these as a substrate, even if incubation times of up to 24h 
were chosen. It can be concluded that proteolytic activity in the human cathepsin L 
preparation recapitulated all major aspects of MUC1 glycopeptide processing in vivo by 
mouse DCs. - 
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/« vftp proteolysis of MUC1 glycopeptides with enzymes in low-density endosomal 
fractions from mouse dendritic cells 

■ 

Mouse dendritic cells were ruptured in the absence of protease inhibitors and the supernatant 
, after removal of nuclei was centrifuged in a Percoll gradient. The gradient fractions were 
tested for proteolytic activity using TAP25 as a substrate and incubation conditions optimized 
for cystein proteases (Fig. 4). Low density endosomes were separated from ' lysosomes 
according to the registration of marker proteins (3 -hexosaminidase) and demonstrated to 
contain a cystein protease inhibitable with Z-Leu-Leu-Leu-fluoromethyl ketone and with a 
site-specificity related to human cathepsin L. Only enzymes in fractions with a density around 
1.037 g/ml cleaved TAP25 peptide and H3/H6 glycopeptides at Thr-Ser yielding SAP 16, 
while all fractions, in particular those with densities above 1.G54 g/ml, contained considerable 
activities of carboxy peptidase(s) (Fig. 4). 

The presented work reveals insight for the first time into the processing of the human 
glycoprotein, tumor antigen MUC1 by DCs. Using state-of-the-art methodologies for the 
structural, characterization of peptides/glycopeptides this study was able to answer four 
important questions regarding MUC1 proteolysis by APCsin the MHC class. II pathway: 1) 
Which are the cleavage sites in the MUC1 repeat, peptide ? 2) In which way do O-linked 
glycans affect proteolytic cleavage ? 3) Are. core-type glycans removed prior to proteolytic 
processing ? 4) Which of the enzymes involved in the processing machinery are responsible 
for proteolytip cleavage of M1JC1 repeat peptides ? Our results suggest that MUC1 repeats 
are cleaved only at one site, between Thr-Ser in the VTSA motif During in vivo processing 
the core-type glycans GalNAc and Galpl-3-GalNAc were not removed, but inhibited the 
cleavage, if they were located adjacent to the cleavage site! The above aspects, in particular 
the site specificity of cleavage and the site-dependent effects of carbohydrates were perfectly 
simulated in vitro using human cathepsin L. Because this enzyme belongs to the cysteine 
protease family related to papain, and is claimed to. be involved in antigen processing (7,8), 
we further confirmed the involvement of cathepsin L (or a closely related enzyme species) in 
MUC1 repeat proteolysis by specific inhibition. It can be anticipated that antigen processing 
in late endosomes is mediated by a family of proteases with partially overlapping, but still 
distinct specificities. Hence, the in vitro data oh cathepsin L cleavage of MUC1 presented in 
this paper do not exclude the possible involvement of other, cathepsin L related enzymes in 
the processing machinery and in the specific cleavage of MUC1 . 



The peptide SAP20 or the glycosylated derivatives carrying O-linked glycans at the more C- 
terminal Thr/Ser positions may represent the preferential, if not exclusive processing products 
of MUC1 repeats in DCs. The structural features of these products match findings from an 
immunological study performed in parallel 1 . According to this work, glycans remain intact 
during processing of MUC1 glycopeptides by DCs, but influence activation of T cell 
hybridoma clones in. a site-specific manner. Clone VF5, reactive to a peptide epitope that 
comprizes the DTR motif, was activated by DCs pulsed with AHG21 glycopeptides which 
carried glycans at Serl6 or Thrl7. No. increase in IL-2 production by this clone was 
measurable, however, if the glycans were located at the proposed epitope or at the Thr/Ser 
positions adjacent to the cleavage site defined in the present study (Thr5, Ser6). Hence the O- 
linked glycans can alter site-specifically proteolytic processing or presentation of the MHC 
class H-restricted glycopeptide. That glycans negatively affect presentation of glycopeptides 
on MHC class II can not be generalized, since loading experiments demonstrated effective 
MHC class II presentation of glycopeptides and activation of T cells (16). There is evidence' 
that glycans have not only negative effects in the DC-mediated processing and T cell 
activation. Consistent with this, we have generated and characterized a T cell hybridoma VF9 
that is reactive to MHC class II-presented MUC1 glycopeptides which carriy Gal01-3GalNAc 
at Thr5 in the AHG21 sequence 1 . The specific epitope of this T cell hybridoma clone VF9 
implies that a 21meric glycopeptide passes the processing machinery uncleaved and is 
presented as such on MHC class II proteins. Thus, O-linked glycans. not only . survive the 
processing machinery of APCs and are presented on MHC class II proteins they can be also 
involved in the epitope, structure recognized by T cell receptors. Based on this it can be 
hypothetically generalized that glycans may contribute in this way to a greater diversification 
of the immune response to the redundant protein core of MUC 1 . 

i 

Tumor-associated MUC1, in particular the glycoforms • from breast cancer cells, have been 
claimed to exhibit underglycosylated protein cores (17), referring to both, to truncated chain 
lengths and to a reduced number of glycosylated sites per repeat peptide. We could recently 
show that this assumption's only partially correct, since the structural analysis of MUC1 
samples that were recombinantly expressed in four different breast cancer cell lines revealed 
increased O-glycbsylation densities (18). Moreover, the patterns of O-linked chains differed 
strongly between individual cell lines indicating that there is no common breast cancer- 



associated profile. Finally, the proposed shift from core2-based glycans to corel was not 
verified, on the contrary, core2 was demonstrated to form the prevalent structural basis of the 
cancer-associated glycans. An average profile of O-linked glycans determined for MUC1 
from pooled ascites samples of breast and pancreatic, cancer patients was also in accord with a 
more complex, core2-based glycosylation (19), Interestingly, this gLycoform of the mucin 
represents a weak immunogen in the MHC class I pathway (20) . and is.non-immunOgenic in 
the MHC class H pathway (14). The latter phenomenon has been assigned to an entrapment in 
early endosomes of DCs mediated by multivalent, high-avidity interaction with the mannose 
receptor (14). It can be concluded, accordingly, that O-glycosylation of MUCl interferes first 
of all with trafficking of endocytosed mucin. Later on, if late endosomal compartments are 
accessible for the antigen, other modes of interference mediated by O-linked glycans could 
also come into play, like asterical inhibition of proteolysis. The existence of such restrictions, 
introduced by site-specific O-glycosylation became evident in the present study, since glycans 
linked to Thr / Ser in the -VTSA motif of MUCl repeats prevented processing of the 
glycopeptides. The specific proteolysis of the MUCl peptide by cathepsin L would explain 
also the observation that a Tn-lOOmer peptide with a total of 15 GalNAc residues linked to 
each Thr5, Serl6 and Thr 17 (numbered according to the. AHG20 sequence) was unable to 
activate Th cell clones \ Hence the site-specificity of glycan substitution, not necessarily the 
glycan structure, has to be considered . in the design of cancer vaccines. Because not all 
glycosylated epitopes become available for MHC class II presentation, . SAP20 and its 
glycosylated derivatives represent a "pre-processed" form suitable for internal or' external 
loading on MHC class II molecules in immunotherapeutic approaches. In loading experiments 
with SAP20 the glycosylation-dependent effects on the binding to MHC class II proteins and 
to the T cell receptors can be studied by systematic variation, of the substitution sites and 
structures of the glycans. 
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Immunogenic MUC1 glycopeptides 



PATENT CLAIMS 

• 1. A peptide fragment of the tandem repeat domain of MUCl, starting with the amino 
acid sequence SAP and comprising at least 9 amino acids. 

2. The MUCl peptide fragment of claim 1, wherein the MUC1 fragment is in the range 
of 10 to 25 amino acids. 

< 3. A MUCl peptide fragment which comprises an amino acid of SEQ ID NO 1-4, or 
variants thereof, wherein said variants comprise one or more insertions, substitutions and/or 
deletions as compared to the sequence of SEQ ID No. 1-4, and wherein the biological activity 
is substantially equal to the activity of the peptide comprising the unmodified amino acid ' 
sequence of SEQ ID No. 1-4, and provided that said variants start with the amino acid 
sequence SAP and comprise at least 9 amino acids. 1 

4. The MUC1 peptide fragment of one or more of claims 1-3, wherein one or more of the 
threonines or serines of the peptide fragment are O-glycosylated. 

5. The MUC1 peptide fragment of claim 4, whiqh corresponds to one of SEQ ID No. 1-4, 
wherein the. amino acid is glycosylated at Thr 5 and/or 12. 

6. A nucleic acid, which encodes a MUCl peptide fragment of any of claim 1-3. 

7. A method of producing the MUCl peptide fragments of one or more of claims 1-5, 
comprising the following steps: 

a) providing a peptide comprising the tandem repeat domain of MUCl or a part thereof, 
which part at least contains one repeating unit of said tandem repeat domain of MUCl, 

b) contacting the peptide of a) with an effective amount of cathepsin-L or a closely 
related enzyme hereof, thereby cleaving the peptide, and 
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isolating the fragments produced in b). 

The method of. claim 7, wherein the peptide, provided- in step a) is natural MUC1 
derived from human milk fat membranes, from human tumor ascites or from human 
breast carcinoma cell lines or is represented by SEQ ID NO 5, 6, 9 or 10. 

The method of claim 7 or 8, wherein one or more of the amino acids of the peptide 
provided in step a) is 0-glycosylated, provided that the peptide is not glycpsylated at 
the cleaving site of cathepsin-L. 

The method of one or more, of claims 7-9, wherein one or more of the threonines or 
serines of the peptide fragment isolated in c) are O-glycosylated. 

A MUC1 peptide fragment, which is obtainable by a method of one or more of claims 
7-10. . - 

An ex vivo-method of producing, a population of autologous antigen presenting cells 
(APC's), which are capable of inducing effective immune responses against MUC1, 
comprising the steps of 

providing autologous APC's from a tumor patient, 

contacting the autologous APC's from the tumor patient with an effective amount of a 
MUC1 peptide fragment of one or more of claims 1-5 or claim 11 under conditions 
which allow ehdocytosis, processing and MHC class H presentation of the peptide 
fragments by said APC's, and 

isolating said MUC1 peptide presenting APC's for the purpose of immunotherapeutic 
application in the patient. . 

The method of claim 12, wherein the MUC1. peptide fragments in b) are bound to 
ferric oxide beads. 

An ex vivo-method of producing genetically engineered APC's, which are capable of 
inducing effective immune responses against MUC1, comprising the steps of 
providing a nucleic acid, which codes for one of the MUC1 peptide fragments of claim 
1-5 or 11, 



transfecting the APC's with said nucleic acid, and 

selecting APC's, which present said MUC1 peptide fragments in an MHC H restricted 
manner. 



The method of claim 14, wherein the nucleic acid is provided in an. expression vector 
in step a). 

An APC, which is obtainable by the method of one or more of claims 12-15. 
The APC of claim 1 6, which is a depdritio cell or a B cell ' 

A therapeutic composition, comprising a therapeutically effective amount of the MUC 
1 peptide fragment, of one or more of claims 1-5 or claim 1 1 or the APC's of one or 
more of claims 16 or 17 and a pharmaceutically acceptable carrier. 

The therapeutic composition of claim 1 8, which is a vaccine. 

Use of the therapeutic composition of claim 18 or 19 in the treatment of breast cancer 
or other MUC 1 -positive carcinomas including colorectal, pancreatic and gastric 
carcinomas. 
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SEQUENCE LISTING ' 
<110> * Cell Center Cologne GmbH 
<120> Immunogenic MUC1 glycopep tides " 
<130> P14655; 
<160> 10 

<170> Patentln version 3.1 

<210> 1 ■ 

■ <211> 20 

■ <212> PRT 

<213>- Artificial Sequence 

<220> . , 

.<22 3> SAP 20 



<400> 1 



£er- Ala ' Pr,o' Asp Thr Arg Pro Ala Pro Gly Ser Thr Ala Pro Pro Ala 
f l 3 * 10 15 



His Gly Val Thr 
20 



<210> 2 

<211> 20 

. <212> PRT 

<213> Artificial Sequence 

<220> " 

<223> SAP 2 0 



<400> 2 ; 



Ser Ala. Pro Glu Ser Arg Pro Ala Pro' Gly Ser Thr Ala Pro Ala Ala 
1 ' 5 10 15 



His Gly Val Thr 
20 



<210>' 3 ' . 

<211> 20 . 
<212> PRT 

<213> Artificial Sequence 

<220> ... 
<223> SAP 2 0 

<400> 3 

Ser Ala Pro Glu Ser Arg Pro Ala Pro Gly Ser Thr Ala Pro Pro Ala 
1 * .5 10 15 



His Gly Val Thr 



20 



. o 0 o-©« c 
o o o 
o c ooo 



CO" o^o 



4 

20 * 
PRT 

Artificial Sequence 



SAP 2 0 
<400> 4 

Ser Ala Pro Asp Thr Arg Pro Ala Pro Gly Ser Thr Ala Pro /Via Ala 
1. ' ' 5 10 15 

His Gly Val Thr 
20 



<210> 
<211> 
<212> 
<213> 

<220> 
<223> 




<210> 5 
c211>- 21. 

><212> • PRT 

<213> Artificial Sequence 

<220> 1 

<223> H1-H3 

<400> 5 - ! 



Ala His . Gly Val Thr Ser Ala Pro Asp Thr Arg Pro Ala Pro Gly Ser 
1 . * 5 . 10, ■ 15 

Thr Ala Pro Pro Ala 





20 


<210> 


6 


<211> 


21 


<212^ 


PRT 


^213> 

f 


Artificial 


<220> 




<223> 


H4-H6 


<400> 


6 



Ala His Gly Val Thr Ser; Ala Pro. Glu Ser Arg Pro Ala Pro Gly Ser 
1 5 10 15 



Thr Ala Pro Ala Ala 
20 



<210> 7 

<211> 16 

<212> PRT 

<213> Artificial Sequence 



<220>. 



Oo OOOO nA «f n oo 
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<223> PI . 
<4 00> 7 * 

Ser Ala Pro Asp Thr Arg Pro Ala Pro Gly Ser Thr Ala Pro Pro Ala 
1 5 ' ' 10 15 




<210> 8 
<211> 16 

<212>* -PRT - 

<213> Artificial Sequence 

<220> ■ 1 

<223> P2 

<400> '8 

Ser Ala Pro Glu Ser Arg Pro Ala 'pro Gly Ser Thr Ala Pro Ala Ala 

1 ' • 5 10- " 15 



<210> 9 

,<211> 25 

<212> PRT ' s ' " 

<213> Artificial sequence 

<220> 

<223> TAP 2 5 
<400> 9 

i 

Thr Ala Pro Pro Ala His Gly Val Thr Ser Ala Pro Asp Thr Arg Pro 
1 • 5 10 15 



Ala Pro Gly Ser Thr Ala Pro Pro Ala 

20' 25 



io • - : 
20 
PRT 

Artificial sequence 1 
<220> 

<223> GST20-AES 4 
<400> 10 

Gly Ser Thr Ala Pro Ala Ala His Gly Val Thr Ser Ala Pro Glu Ser 
1 5 10 15 



<210> 
(F*ff <211>, 
V ^«^ L <212> 



Arg Pro Ala Pro 
20 



